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We consider recent reports on small-world topologies of interaction networks derived from the dynamics of 
spatially extended systems that are investigated in diverse scientific fields such as neurosciences, geophysics, 
or meteorology. With numerical simulations that mimic typical experimental situations we have identified 
an important constraint when characterizing such networks: indications of a small-world topology can be 
expected solely due to the spatial sampling of the system along with commonly used time series analysis 
based approaches to network characterization. 
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The complex dynamics of spatially extended nat- 
ural systems such as the human brain or the cli- 
mate and the earth system is notoriously difficult 
to understand and has repeatedly stimulated a 
variety of scientific effbrts, among them the de- 
velopment of sophisticated data analysis meth- 
ods. Concepts from network theory promise to 
improve our understanding of such complex sys- 
tems. These approaches consider a system as 
being composed of dynamically interacting sub- 
systems whose functional interdependencies are 
refiected as links in an interaction network. In- 
teraction networks derived from field data of dif- 
ferent dynamical systems have been consistently 
reported to possess small-world characteristics — 
high local clustering and small average shortest 
path length — which is considered as an indica- 
tion of a common organization principle of natu- 
ral dynamical systems. However, little attention 
has so far been paid to the conditions and as- 
sumptions underlying such analysis approaches. 
We here demonstrate that indications of small- 
world characteristics of interaction networks can 
solely be expected due to the spatial sampling of 
dynamical systems together with commonly used 
time series analysis techniques for the character- 
ization of network links. Our findings not only 
call for the development and use of methods tak- 
ing into account the spatial sampling of the stud- 
ied systems but also for a careful interpretation 
and re-consideration of analysis results obtained 
so far. 
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I. INTRODUCTION 

Over the last decade, network theory has contributed 
significantly to improve our understanding of complex 
systems, with wide applications in diverse fields, rang- 
ing from physics to biology and medicine^^^. With the 
introduction of the small- world network model by Watts 
and Strogatz^ research into complex networks has gained 
strong momentum. Since then, numerous studies have 
shown that a wide range of real world networks can be 
regarded as small-world networks (SWN) that are char- 
acterized by high levels of local clustering among nodes 
and by short paths that globally link nodes. Global and 
local network properties can be assessed with different 
measures^ among which the average shortest path length 
L and the clustering coefficient C are widely used in field 
studies. L is the average shortest distance between any 
pair of nodes, and C characterizes the local interconnect- 
edness of nodes^. Large values of both L and C are typi- 
cal for lattices while low values of L and C are found for 
random networks. A low value of L (or, from a network 
theoretic point of view, L scaling at most logarithmically 
with the number of nodes A^) and a high value of C is 
usually considered as indicative for SWN"*. 

Representing a complex system as a network requires 
identification of nodes and links which can be achieved 
straightforwardly in many scientific fields. However, 
when investigating the dynamics of spatially extended 
systems in terms of complex networks - such as in the 
neurosciences, in geophysics, or in meteorology - identi- 
fication of nodes and links is a challenging issue. Nodes 
are usually assumed to represent distinct subsystems 
and links represent interactions between them, and these 
nodes and links constitute an interaction network. Lack- 
ing explicit knowledge of the structural organization of 
the dynamical system, nodes are usually associated with 
sensors that are placed so as to sufficiently capture the 
subsystems' dynamics, thereby considering theorems for 
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an appropriate spatial and temporal sampling. While the 
latter is usually not an issue, choosing the right num- 
ber of sensors and placing them in a meaningful way is 
highly nontrivial. When characterizing links, one is often 
faced with the problem that the underlying equations of 
motion are not known or that interactions between sub- 
systems can not directly be measured experimentally. In 
these cases, time scries analysis techniques are employed 
that aim at quantifying linear or nonlinear interdepen- 
dencies between observables of subsystems recorded at 
the sensors Eventually, weighted or binary (e.g., 
via thresholding), mostly undirected networks are con- 
structed, and inference about small- world characteristics 
is usually based on a comparison of experimentally de- 
rived values of L and C to those of corresponding ran- 
domized networks. Using this approach small- world char- 
acteristics have been repeatedly reported for brain func- 
tional networks under both physiologic and pathophys- 
iologic conditions^^'^^, and for seismic^'^'^^ and climate 
networks^^"^''. Despite the use of highly sophisticated 
analysis techniques, the aforementioned challenging is- 
sue of properly identifying nodes and links of interaction 
networks remains largely unsolved. 

We here demonstrate - by simulating typical exper- 
imental situations in the diverse scientific fields - that 
indications of a small-world topology can be expected 
solely due to the spatial sampling together with influ- 
encing factors that can be attributed to the widely used 
data-driven approaches to link and network characteri- 
zation. 



II. INFERRING AND CHARACTERIZING 
INTERACTION NETWORKS FROM FIELD DATA 

We begin with recalling approaches that have been em- 
ployed very frequently to derive and analyze interaction 
networks. First, interdepcndencies between subsystems 
are estimated using some bivariate time series analysis 
technique. Among the many available methods''"^°'^^"^° 
particularly symmetric techniques that aim at assessing 
the strength of interaction between time series i and 
j are often used to derive undirected binary networks. 

Second, from matrix M with entries m.y = rriji the 
adjacency matrix A representing the assumed underlying 
interaction network is derived by thresholding. Let N 
denote the number of nodes of the interaction network 
and let ki denote the degree of node i, i.e., the number of 
nodes to which node i is connected. In most field studies. 



the mean degree k = 7V~^ ki is chosen and entries Uij 
of A are set to 1 for all kN largest entries rriij (resulting in 
kN/2 undirected links), and a^- = (no link) otherwise. 

Third, clustering coefficient C and average shortest 
path length L are determined to characterize the interac- 
tion network and to assess its possible small-world char- 
acteristics. The local clustering coefficient Ci quantifies 
the local interconnectedness of the network and is defined 
as the fraction of the number of existing links between 



neighbors of node i among all possible links between these 
neighbors'^'^'^ 



0, 



if h e {0, 1}. 
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The clustering coefficient C of the network is then defined 
as the mean of the local clustering coefficients 
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and represents a local scale property of the network. Note 
that Ci,C e [0,1]. 

The average shortest path length represents a global 
scale property of a network and is defined as the average 
distance between any two nodes. 
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where Uj denotes the length of the shortest path between 
node i and j. Note that some authors include the dis- 
tance from each node to itself in the average (as we do 
here, la = 0), while others exclude it"^. Exclusion will, 
however, only change the value of L by the constant fac- 
tor of {N + ^)/{N — 1). An issue which may be encoun- 
tered when determining L for interaction networks de- 
rived from field data are pairs of nodes which do not pos- 
sess a connecting path, in which case /y = oo. To avoid 
this problem, it has been proposed to replace /y in Eq. 
3 with resulting in a quantity called efficiency^^'^^. 
Another approach, which is often used in field studies, is 
to exclude infinite lij from the average in Eq. 3. We here 
follow this ansatz and define the average shortest path 
length L, 
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where 



S = {(«,.?) I hj < oo; i,j = 1,...,A^} 



(4) 



(5) 



denotes the set of all pairs of nodes which are con- 
nected through some path with finite path length lij , and 
where Ni =\ S \ denote the number of such pairs. 

From a network theoretic point of view^'^'^, L scaling at 
most logarithmically with N, and a high value of C is con- 
sidered the hallmark of SWN topology. This approach, 
however, comes along with a requirement that can hardly 
be fulfilled in typical experimental situations. In some 
cases it is impossible to vary the number of sensors (i.e. 
number of nodes) due to experimental restrictions, while 
in other cases it might be the nature of the system itself 
that does not allow the recording of observables from an 
arbitrary number of sensors. Moreover, even if it were 
possible to vary the number of sensors, this would re- 
quire choosing which sensor to remove or where to add 
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an additional sensor — a choice that is highly dependent 
on a priori knowledge of the spatial organization of the 
dynamical system. It is thus not astonishing, that field 
studies do not evaluate a possible scaling behavior of L. 
Instead, in the fourth step values of C and L are typically 
compared with values (Cr) and (L,) obtained from en- 
sembles of the corresponding random networks^^'^^ that 
are generated by randomizing the original network while 
preserving the degree of each node. Eventually, values of 
7 := C/(Ci.) > 1 and A := L/ {L^) « 1 are assumed to be 
indicative of SWN. 



An illustrative example from field data analysis 

Using the aforementioned approaches for inferring and 
characterizing interaction networks we analyzed time se- 
ries of brain magnetic activities (magnetoencephalogra- 
phy; MEG^^) that were recorded (sampling rate: 254.31 
Hz: 16 bit A/D conversion; bandwidth: 0.1-50 Hz; 148- 
channel magnetometer system, cf. Fig. lA)) from a 
healthy subject with eyes closed^*" (the subject had signed 
informed consent that the data might be used and pub- 
lished for research purposes, and the study was approved 
by the local medical ethics committee). Here we restrict 
ourselves to N ~ 130 sensors in order to minimize con- 
taminations with muscle activity in the lowermost ring of 
sensors. As an interdependence measure, here we choose 
the absolute value of the correlation coefficient, 



Pij 



T- 
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where Xi(t) and Xj(t) denote MEG time series of length 
T {T = 4096 data points) recorded at sensors i and j, 
respectively, and (x) and a denote the respective mean 
values and standard deviations. In Fig. IB we show 
the matrix M with entries = pij. We derive the 
adjacency matrix A (with entries a^j) of the interaction 
network by choosing a mean degree k and thresholding 
as described above. A displays a pattern of diagonals, 
which can also be found in M and can be attributed to 
spatially close sensors (cf. Fig. lA). For A we obtain 
C = 0.58 and L = 3.13. As in many field studies, we 
proceed by comparing values of C and L with those of 
100 realizations of corresponding random networks^^'^^ 
and assume 7 > 1 and A « 1 to be indicative of SWN. 
With 7 = 4.21 ±0.15 and A = 1.53±0.01 this interaction 
network would have been interpreted as SWN. 

Let us now consider some interdependence measure rh 
that depends on the Euclidean distance in 3-dimensional 
space dij between sensors only. We assume m to de- 
crease strictly monotonically with dij such that m from 
spatially close sensors will attain larger values than rh 
from spatially distant sensors. This leads to a network 
with distance-dependent connectivity structure, an ex- 
ample of a spatial network"*'^*. In Fig. IC we show that 
both M and A display characteristic diagonal patterns 
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FIG. 1. (Color online) (A) Schematic of the spatial arrange- 
ment of a subset of sensors used to sample the dynamics of 
a human brain by MEG. (B) Left: Exemplary matrix M 
where entry rriij — rriji is the absolute value of the corre- 
lation coefficient between MEG time series Xi{t) and Xj{t) 
from sensor pair (i,j). Right: Adjacency matrix A derived 
from M by thresholding with k = 15. (C) Left: Matrix M 
with entries rhij — F{dij), where dij denotes the Euclidean 
distance between sensors i and j in 3-dimensional space, and 
F{dij) = (1 + exp{u{dij — v)))~^ with u — 23 and v — 0.1. 
Right: A derived from M by thresholding with k — 15. Note 
that A is not afi'ected by the choice of F, as long as F de- 
creases strictly monotonically with increasing dij. Entries of 
all matrices range from (black) to 1 (white). 



already observed in the respective matrices derived from 
field data. For A we obtain C = 0.57 and L = 3.14, 
and with 7 = 4.97 ± 0.18 and A = 1.55 ± 0.01 even this 
network would have been interpreted as SWN. 



III. INVESTIGATION OF INFLUENCING FACTORS 

There are a number of influencing factors that can ad- 
versely affect the assessment of small-world characteris- 
tics of interaction networks derived from field data. In 
addition to the unavoidable imprecision of the data ac- 
quisition, the accuracy of some interdependence measure 
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FIG. 2. (Color online) Top: Mean values of normalized clus- 
tering coefficient 7 (left) and normalized average shortest path 
length A (right) for two-dimensional square lattices with dif- 
ferent numbers of nodes A'^ and mean degrees k (maximum 
standard deviations: a-y = 0.02 and a\ = 0.02). White crosses 
mark (A'^, k) configurations for which lattices will be classified 
as SWN if 7 > 2 and A < 2 is chosen as a practical crite- 
rion. Bottom: Minimum fraction of randomly replaced links 
K for which the resulting network would be classified as SWN 
(A < 2) in dependence on the density of links ^. Error bars 
denote standard deviations derived from 100 independent re- 
placement runs, and lines are for eye guidance only. Note 
that error bars are smaller than symbol size in the majority 
of cases. 



used to characterize links is usually restricted due to the 
limited amount of accessible data and is spoiled due to 
unavoidable noise contributions. Together with thresh- 
olding methods for deriving interaction networks — for 
which the mean degree is often chosen empirically — this 
may lead to spurious additional links present in the net- 
work as well as to spuriously missing links. Thus, the 
question arises as to how reliable do we have to esti- 
mate links in order to safely infer small-world charac- 
teristics of interaction networks from field data? More- 
over, strongly interdependent signals may either reflect 
some functional interaction between different subsystems 
or may be caused by sampling the same subsystem, but 
most time series analysis techniques do not allow one 
to unequivocally distinguish between both cases. How 
does this affect the inference of small-world character- 
istics of interaction networks, even in cases where such 
a distinction was in principle possible? In the following 
paragraphs we address these questions in more detail. 



Impact of measurement uncertainties and latticelike 
arrangement of sensors 

We assume N sensors to be arranged on a square- 
lattice - which is quite typical for field studies - and 
associate sensors with nodes. Taking into account that 
interactions between spatially neighbored nodes are usu- 
ally stronger than spatially distant nodes (spatial cor- 
relations), we assume an interdependence measure to 
decrease strictly monotonically with an increasing Eu- 
clidean distance between sensors only. We derive links 
via thresholding with mean degree k. Note, that the 
derived networks possess lattice topologies due to the 
construction scheme. We chose values of N and k as 
typically reported in field studies and estimate 7 and A 
for the derived networks as above. For a range of {N, k) 
values (cf. Fig. 2, upper part), we observe 7 1 and 
A « 1, which would indicate these networks to possess 
small- world characteristics. For parameters from the up- 
per right region of the (A'', k) plane, networks would not 
be classified as SWN since 7 approaches 1 with an in- 
creasing density of links ^ := k/{N — 1). For sparse net- 
works (lower left region of the (iV, k) plane) we observe 
A ^ 1 for a range of {N, k) values, which also would not 
indicate SWN. For these sparse networks, however, the 
reliability of estimating links is of crucial importance for 
a correct classification. 

In principle, a limited reliability of link estimation will 
lead to spurious additional links and to spurious missing 
links contributing to the topology of the network. Con- 
trolling the amount of spurious links could be achieved 
by multiple testing against some appropriately chosen 
null model^^^'^''. Such an approach allows to control 
the probability of erroneously detecting links (false pos- 
itives), but it is well known for its limited power which 
leads to a starkly increased number of false negatives 
(missing links). In addition, taking into account prob- 
lems of defining appropriate null models for time series 
(i.e. surrogates'^^) and the computational burden of gen- 
erating them, multiple testing procedures have not found 
wide applicability in studies of interaction networks de- 
rived from field data. Nevertheless, we can carry over 
concepts from multiple testing methods to learn about 
the reliability needed to correctly infer from 7 and A 
that the networks in the lower left corner of the {N, k) 
parameter space (cf. Fig. 2 upper part) possess lattice 
topologies. For these sparse networks we model uncer- 
tainties from estimating links (i.e., the number of false 
positives) by randomly replacing links in the networks. 
With rir we denote the average minimum number of ran- 
domly replaced links that would lead to a classification 
of the original lattice as SWN due to a decrease of the 
average shortest path length such that 1 « A < A. As a 
practical criterion we here exemplarily set A = 2. In the 
lower part of Fig. 2 we show that a minimum fraction of 
randomly replaced links k := 2nr/{kN) of less than 2 % 
would suffice to falsely classify these sparse networks as 
SWN (note that k represents the false discovery rate'^^ in 
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FIG. 3. Left: Local clustering coefficient C* of node i of A/"* 
as a function of d of f\f for different node degrees ki. Con- 
struction of J\f* is shown schematically in the inset. Nodes 
and links included in M and A/"* are colored black, while nodes 
and links included in A/"* only are colored gray. Right: Means 
of C{p) ■- C(p)/C(0) (open symbols) and L{p) ■- L{p)/L{0) 
(filled symbols) for Af depending on the rewiring probability 
p (lines are for eye guidance only). C*{p) and L*{p) denote 
the corresponding quantities for A/"*. We used the Watts- 
Strogatz scheme (A'^ = 1000, = 4, 1000 realizations for each 
p) to generate M networks (symbol A) and derived N* net- 
works (symbol v) by duplicating all nodes from M. Standard 
deviations for all quantities are smaller than symbol size. 



the context of multiple testing procedures). The replace- 
ment of links affects 7 only marginally, and we always 
observed 7^1. Note that k even decreases with an in- 
creasing density of links ^. Particularly for networks with 
a small number of nodes and depending on the chosen 
mean degree we observe that only one to five randomly 
replaced links lead to A < A. We note that together with 
theoretical arguments'^'^"'^^ our findings are decisive for a 
characterization of networks derived from field data. By 
construction, the average shortest path length L depends 
sensitively on the actual link structure and changing or 
adding only a few links can result in a remarkable change 
of L. 

Summarizing, under unfavorable conditions the often 
used latticelike arrangement of sensors (we note that 
we obtained similar findings for three-dimensional lat- 
tices) together with a limited reliability of estimating 
links leads to indications of a small-world topology of 
interaction networks derived from the dynamics of spa- 
tially extended systems, even if the underlying interac- 
tion structure is not SWN. 



Impact of common sources 

We now consider time series analysis techniques which 
can not distinguish between interdependeneies due to 
functional interactions between different subsystems and 
interdependeneies caused by sampling the same subsys- 
tem (i.e., a common source). We address the question of 
how these techniques affect the inference of small-world 
characteristics. We assume that a given system can be 
regarded as some network M with some topology and 



that M consists of A'' nodes and some number of links. 
Ideally, one would sample the system by placing N sen- 
sors such that each captures the dynamics of the respec- 
tive subsystem. In field studies, however, the number 
of subsystems reflecting the dynamical organization of 
TV is usually not known a priori and we would choose 
the number of sensors such that they allow a reasonably 
dense spatial sampling. This might come along with the 
risk of sampling the dynamics of the same subsystem 
by two or more sensors. With the time series analysis 
techniques we have in mind (such as cross correlation, 
coherency, mean phase coherence, mutual information), 
this will lead to an indication of strongly interdependent 
time series. We simulate this situation by introducing for 
each ideally placed sensor i {i G 1, . . . ,N) another sen- 
sor i' with zero spatial distance between i and i' . This 
leads to a network TV* possessing N* — 2N nodes. Each 
duplicate node i' inherits the same neighbors of i and is 
connected to i (cf. inset in left part of Fig. 3). We derive 
C* and L* of TV* as functions of d and L of TV as 

C* = I 2fef+l + 24+1 ' ''^^^i> ^ /y^, 

' \ 0, iffc, =0 ^' 

L*=L + L+ withL+ = ^, (8) 

where ki and TV; are quantities of TV. Note that g 
Iw' 5] '^here the lower bound holds for connected net- 
works (a path exists between every pair of nodes) and 
the upper bound for networks without links. Obviously 
the impact of introducing additional nodes (i.e., sensors) 
on the average shortest path length can be neglected, 
since L* k, L. In contrast, the clustering coefficient 

C* = (A^*"^E^*iC'*) > C' increases, since C* > C, 
(ef. Fig. 3 left). In the right part of Fig. 3 we demon- 
strate this effect for different network topologies of TV 
that we derived by employing the construction scheme 
proposed by Watts and Strogatz^. For all rewiring prob- 
abilities p we observe I/*(p)/L*(0) w L{p)/L{0), however, 
C*(p)/C*(0) clearly exceeds C(p)/C(0) with increasing 
p such that even TV* derived from random networks Af 
{p ~ 1) would have been characterized as SWN. 

We now evaluate the opposite situation and consider 
time series analysis techniques, which we assume to be 
able to distinguish between interdependeneies due to 
functional interactions between different subsystems and 
interdependeneies due to a common source. We proceed 
as above and introduce for each ideally placed sensor i 
(i g 1, . . . , TV) another sensor i' with zero spatial distance 
between i and i' , which again leads to a network Af* pos- 
sessing N* ~ 2N nodes. Each duplicate node i' inherits 
the same neighbors of i but — different to the simulation 
before — this duplicate node is not connected to i (cf. 
inset in left part of Fig. 4). We derive C* and L* of Af* 
as functions of Ci and L of TV as 
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FIG. 4. Left: Local clustering coefficient C* of node i of A/"* 
as a function of d of A/" for different node degrees ki. Con- 
struction of J\f* is shown schematically in the inset. Nodes 
and links included in TV and A/"* are colored black, while nodes 
and links included in A/"* only are colored gray. Right; Means 
of C{p) ■- C(p)/C(0) (open symbols) and L{p) ■- L{p)/L{0) 
(filled symbols) for A/" depending on the rewiring probability 
p (lines are for eye guidance only). C*{p) and L*{p) denote 
the corresponding quantities for A/"*. We used the Watts- 
Strogatz scheme (A'' = 1000, = 4, 1000 realizations for each 
p) to generate A/" networks (symbol A) and derived A/"* net- 
works (symbol v) by duplicating all nodes from A/". Standard 
deviations for all quantities are smaller than symbol size. 



employing the Watts-Strogatz construction scheme. For 
all rewiring probabilities p we observe L*{p)/L*{0) w 
L(p)/L(0) and C*{p)/C*{0) < C{p)/C{0). Thus, J\f* 
derived from random networks J\f (p = 1) would not 
be characterized as SWN but as random networks when 
employing time series analysis techniques, which can un- 
equivocally distinguish between interdependencies due to 
a common source and interdependencies due to func- 
tional interactions between different subsystems. 

To our knowledge, only few time series analysis tech- 
niques have been published that address the problem of 
interdependencies due to sampling the same (or nearly 
the same) dynamical subsystem'^^''^^. These techniques 
are based on the assumption that common components 
in both time series stemming from the same dynamical 
subsystem will lead to instantaneous interdependencies 
(with zero time lag). Separating these instantaneous in- 
terdependencies from those associated with a non-zero 
time lag could lead to techniques capturing interdepen- 
dencies between different interacting dynamical subsys- 
tems only. We note, however, that these techniques have 
not yet found broad application in field studies. 



c: = a^^, (9) 

L* = l*L + Lt, (10) 

where 

A^o is the number of nodes in Af with no neighbors, i.e. 
No =\ {i \ k, ^ 0,i = 1,...,N} \. Note that /* £ [1,2] 
where the upper bound holds for the special case of net- 
works M without links {Nq = N) and the lower bound 
for networks where each node possess at least one link 
(iVo = 0), which, e.g., is the case for connected networks. 
L2 G [0, ^] where the lower bound holds for networks 
without links and is approached by connected networks 
(Lj = iV-i) and the upper bound is approached by the 
special case of networks with decreasing A^o and increas- 
ing number of connected components and reached for 
N/2 connected components and A*o = 0. Again, the im- 
pact of introducing additional nodes (i.e., sensors) on the 
average shortest path length can be neglected in networks 
possessing links {L* w L). The clustering coefficient C* 
decreases since C* < Ci depending on the corresponding 
degrees of the nodes (cf. Fig. 4 left). Note, however, that 
the maximum possible reduction amounts to C* = ^Ci 
(ki — 2) only and that C* = Ci for ki G {0, 1} together 
with C* Ci for increasing ki will likely cause only a 
slight decrease of C* in real world networks. 

In the right part of Fig. 4 we present our findings for 
different network topologies of Af* that we derived by 



IV. DISCUSSION 

Several influencing factors may hamper the unequivo- 
cal inference of small- world characteristics in interaction 
networks derived from field data using well accepted time 
series analysis techniques together with network theo- 
retic approaches. Taken together, these infiuences will 
most likely lead to findings of small- world characteristics 
in interaction networks irrespective of their actual orga- 
nization. These influences may be associated with four 
aspects which we discuss in the following. 

First, the commonly used comparison of values of C 
and L with those obtained for corresponding random 
networks comes along with the risk of classifying even 
an actual lattice as SWN (cf. Fig. 2 top). The reason 
is, that such a comparison can only provide clues as to 
how much the topology of the network under study dif- 
fers from that of a random network but not from that of a 
lattice. A comparison with lattices has been proposed'^* 
but has not yet been found wide application. The latter 
can be attributed to the fact, that when comparing with 
lattice topologies one has to decide upon the dimension- 
ality and construction of such lattices, both representing 
non-trivial choices, which can decisively influence the re- 
sult of such a comparison. 

Second, the addition of, the change in, or the deletion 
of only a few links — a likely consequence of a limited 
reliability of estimating links — can significantly change 
the value of average shortest path length L. In networks 
possessing latticelike topologies (large C and L values), 
uncertainties in link estimation can likely cause L to de- 
crease leading to small-world characteristics of the net- 
work under study (cf. Fig. 2 bottom). In the light 
of a limited reliability of time series analysis techniques 
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used to estimate links, multiple testing procedures can 
help control the probability of false positives in networks 
derived from field data^^. Whereas methods controlling 
the family- wise error (i.e., the probability of detecting 
false links among all possible pairs of nodes) have been 
developed over the years but are known to come along 
with a high risk of false negatives"^^ and thus missing 
links, methods controlling the false-discovery rate (i.e., 
the probability of false positives among all inferred links) 
appear to be promising approaches with lower risk of false 
negatives^^''^^'*°. However, limiting the probability of er- 
roneously adding, changing, or deleting just a few links 
calls for small probabilities of both, detecting false posi- 
tives as well as missing false negatives, which represents 
a challenging task for currently available multiple testing 
methods. In addition, hypothesis testing involves defin- 
ing appropriate null-models for time series, which again 
represents a challenging issue'^^'^^. 

Third, the placement of sensors to capture the dynam- 
ics of an unknown spatially extended complex system and 
their representation as nodes of an interaction network 
already represents an interpretation of the data which 
comes along with various pre-assumptions, e.g. the sys- 
tem can be meaningfully decomposed into subsystems 
and their dynamics is captured by the sensors. Missing 
to capture the dynamics of only a few of the subsystems 
may lead to a dramatic change in L as discussed above. 
In addition, capturing the dynamics of the same subsys- 
tem with two or more sensors together with commonly 
used time series analysis techniques to measure interac- 
tions can lead to an increase of the clustering coefficient 
C of the network (cf. Fig. 3). This may lead to the in- 
ference of small-world characteristics even for interaction 
networks actually possessing a random network topology. 
We have demonstrated (cf. Fig. 4) that time scries anal- 
ysis techniques that would be capable of unequivocally 
distinguishing between interdependencies due to common 
sources and interdependencies due to interacting dynam- 
ical subsystems can overcome the problem of an artificial 
increase of the clustering coefficient. In this context we 
mention two time series analysis techniques'^^''^^ which 
purport to be capable of distinguishing between such 
cases. It remains to be shown, however, whether these as 
well as other time series analysis techniques are capable 
of distinguishing between direct and indirect interactions, 
another potential mechanism for a spuriously increased 
clustering coefficient. An alternative approach toward 
tackling the problem of a spuriously increased cluster- 
ing coefficient is to manually correct C for the influence 
of spatially sampling the same dynamical aspect"*^. This 
approach, however, relies on a priori knowledge about the 
system which may not be generally available. A decom- 
position of systems - which actually represent spatial dif- 
fusion or field processes - into subsystems may introduce 
spatial correlations in the network topology and leads to 
the question as to how much the derived interaction net- 
works depend on the coarse graining of the dynamics. In 
this case, interaction networks may not describe proper- 



ties inherent to the studied dynamics but properties that 
solely depend on the applied coarse graining scheme. The 
value of such a description may vary and will depend on 
the application and aim of the study. 

Fourth, already in the first small- world model of Watts 
and Strogatz^ the wide regime of small-world networks 
is flanked by the special cases of random and lattice net- 
works. Together with the aforementioned influencing fac- 
tors, findings of small- world interaction networks in an 
absolute sense should be expected and are not surpris- 
ing. In contrast, when being able to eliminate all infiu- 
encing factors, findings of the special (and thus possi- 
bly rare) topologies of random or lattice interaction net- 
works would be surprising. Besides deciding upon small- 
world characteristics in an absolute sense, more recent 
studics^'''^'^'^^ aim at studying relative changes of C and 
L only. In view of the wide regime of small-world net- 
works, such an approach appears more promising. 



V. CONCLUSION 

We have identified an important constraint when ana- 
lyzing interaction networks derived from the dynamics of 
spatially extended systems, namely that solely the spa- 
tial sampling of the system together with a time scries 
analysis based link characterization can lead to indica- 
tions of a small-world topology. The arrangement of 
sensors commonly used in field studies along with the 
ansatz of considering a spatially extended dynamical sys- 
tem as a complex network of interacting subsystems im- 
poses a spatial structure on the system, irrespective of 
its actual organization, which may also underlie spatial 
restrictions. Given these constraints, it would be surpris- 
ing to not observe small-world indications for interaction 
networks derived from spatially extended dynamical sys- 
tems. Whether the actual interaction structure of these 
systems is indeed small-world or not — a question ad- 
dressed in various studies in the neurosciences, and the 
climate and earth sciences — cannot be unequivocally an- 
swered with currently employed analysis techniques. We 
therefore consider it advisable to avoid interpretations 
of findings of small- world topologies for such interaction 
networks in an absolute sense. 

Here we here restricted our investigations to widely 
used network characteristics, the clustering coefficient C 
and average distance between nodes L. It is conceivable 
though, that the abovementioned constraint will likely 
affect other network characteristics. To advance our un- 
derstanding of the dynamics of spatially extended sys- 
tems we consider the following directions of research as 
promising: (i) improving the determination of the actual 
structural organization of a dynamical system can help to 
advise the design of appropriate sensor placement strate- 
gies (such an approach is currently being pursued in the 
neurosciences^^); (ii) developing novel and refining exist- 
ing time series and network analysis tcchniques^^''^^'^^^^^ 
together with computational network analyscs^^^^^ can 
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help to unravel functional interactions from contributions 
that result from sampling the same subsystem. 
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